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The  Estimation  and  Interpretation  of  Several 
Selectivity  Models 


Robert  P.  Trost* 


1.  INTRODUCTION 

NJ.„ 


recent  years  there  have  been  a  large  number  of  studies  that  deal 
with  the  problem  of  selectivity  bias  in  the  data.  Here  the  term  "selectivity 
bias"  refers  to  non-randomly  distributed  observed  data.  This  non-randomness 
can  occur  whenever  the  data  we  have  are  generated  by  the  choices  that 
individuals  make.  A  review  of  selectivity  problems  in  econometric 
models  can  be  found  in  two  papers  by  Maddala  (1977). 

The  purpose  of  the  present  paper  is  to  review  several  models  not 
discussed  in  Maddala's  (1977)  papers,  and  to  give  a  further  interpretation 
of  the  covariance  terms  that  are  particular  to  selectivity  models.  This 
should  enhance  the  understanding  of  these  models. 


2.  FOUR  models  of  selectivity 

In  this  section  1  present  four  models  of  selectivity  and  discuss 
the  estimation  of  each.  The  four  models  arc  :  (1)  measuring  the  returns 

of  a  college  education  when  almost  everyone  in  the  sample  is  working;  and 
wnrn  the  level  of  college  education  does  not  enter  directly  into  the 
earnings  equations,  (2)  measuring  the  returns  of  a  college  education  when 
almost  everyone  in  the  sample  is  working/  and  when  the  level  of  college 
education  dors  enter  directly  into  the  earnings  equations,  (3)  measuring 
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the  returns  of  a  college  education,  when  only  part  of  the  sample  decides 
to  work,  and  the  level  of  college  does  not  enter  directly  into  the 
earnings  Equations,  and,  (4)  measuring  the  returns  to  a  specific  type  of 
military  training.  A  discussion  of  these  four  models  follows. 

Model  1 

A  simple  model  for  measuring  the  returns 
to  a  college  education 

The  question  this  first  model  tries  to  answer  is:  Are  expected 
earnings  higher  if  an  individual  completes  at  least  one  year  of  college? 
A  naive  way  to  answer  this  question  is  to  estimate  a  dummy  variable 
regression  equation  where  earnings  are  regressed  on  several  explanatory 
variables  and  a  dummy  variable  which  takes  a  value  of  1  if  the  individual 
goes  to  college  and  0  otherwise.  There  is,  however,  a  self-selectivity 
problem  that  needs  to  be  analyzed  here.  The  self-selectivity  problem  is 
that  some  individuals  choose  to  go  to  college  and  others  do  not.  If 
this  choosing  process  is  based  on  potential  college  versus  non-college 
earnings,  then  the  dummy  variable  in  the  above  naive  approach  should  not, 
in  general,  be  treated  as  exogenous.  Rather,  if  the  individuals  who 
go  to  college  are  the  ones  who  can  most  benefit  by  it,  and  vice  versa 
for  those  who  choose  not  to  go  to  college,  the  dummy  variable  should  be 
treated  as  endogenous.  The  net  impact  of  this  endogenous  choosing 
process  is  that  the  earnings  data  we  do  observe  may  not  be  a  truly 
random  sample.  Hove  the  term  "random  sample"  refers  to  the  sample  of 
college  and  non-college  earnings  we  would  have  observed  if  we  could 
simultaneously  measure  everybody's  earnings  both  with  and  without 
college.  In  order  to  correct  for  this  potential  non-randomness  in  the 


observed  data,  I  specify  the  following  model. 

I*  =  y*'X  -  t*,  (1) 

where  we  only  observe: 

1=1  iff  I*>  0<-T>  c*<  y* ' X 
=  0  otherwise. 

More  X*  is  an  unobservable  underlying  index  whereby  an  individual  chooses 

p 

whether  or  not  to  go  to  college,  e*  ^  N(o,o  )  and  Jf*  is  a  vector  of  para¬ 
meters  to  bo  estimated.  If  I*  >  0,  then  the  individual  chooses  college, 
and  if  I*  <  0  he  does  not  go  to  college.  Although  wo  do  not  observe 
the  index  I*,  we  do  observe  the  choice  of  whether  or  not  to  go  to 
co'llcge.  This  observable  choice  is  represented  by  the  dichotomous  index 
I  =  1  if  I*  ?  0  and  1=0  otherwise. 

The  model  also  specifies  two  earning  equations: 

E1  ~  ^l'Xl  +  E1  (college  earnings)  (2) 

K2  =  ^2  X2  +  E2*  (w/o  colie9e  earnings) ,  (3) 

where  we  only  observe; 

E^  iff  1=1,  and 
E2  iff  1=0. 

In  equations  (2)  and  (3),  and  P2  are  parameters  to  be  estimated,  and 

2  2 

t  and  r  are  distributed  N(0,o  ),  N(0,o  ),  respectively.  The  model  does 
1  1  ^ 

not  assume  that  and  e2  are  independent  of  e*.  The  question  we  want 
to  answer  is:  for  a  given  level  of  the  exogenous  variables,  is  the 
expected  value  of  E^  greater  than,  equal  to,  or  less  than  the  expected 
value  of  E2?  To  answer  this  question  we  need  to  estimate  0^  and  02* 
Consider  the  estimation  of  0^  first. 

If  we  could  observe  earnings  E^  for  everyone,  then  there  would  be 


no  serious  estimation  problem.  We  could  simply  estimate  P^  by  OLS. 

Similarly,  if  the  E^  earnings  we  do  observe  are  randomly  selected 
from  the  entire  sample,  then  again  OLS  would  be  alright.  The  problem  is, 
the  observed  E  earnings  may  not  be  a  random  sample.  That  is,  the  expec¬ 
tation  of  E^  for  the  sub-sample  we  do  observe  may  not  equal  the  expectation 
of  E^  for  the  entire  sample.  In  fact,  if  is  correlated  with  c* ,  the 
observed  E^  will  not  be  random  because  we  only  observe  E^  (and  the  associated 
disturbance  when  e*  <  y*'X.  It  can  easily  be  shown  that: 


where  y=  y*/o,  £(•)  is  the  standard  normal  density  function,  F ( - )  is  the 

standard  normal  cumulative  function, and  <;  is  the  covariance  between 

lf- 

e  and  e.  Proofs  of  these  two  expectations  can  be  found  in  papers  by 
1 

Lee  (1976)  and  Heckman  (1976). 

To  estimate  PL  and  f?2  by  a  least  squares  procedure,  Lee  (1976)  and 
Heckman  (1976)  suggest  a  two  stage  procedure.  First,  estimate  y  with  a 
probit  model.  Second, estimate  by  least  squares  the  following  two  equations: 

Ei "  Vxi  Fir#  *  "i  (2*> 


where  y  is  the  consistent  probit  estimate  of  y,  and  n.  and  r\  are  distur 


bances  wi tli  zero  moan.  Least  square  on  (2a)  and  (3a)  will  yield  consistent. 


albeit  inefficient,  estimates  of  P,*P-»°,,_  antl  0 


efficient  estimates  of  all  the  parameters  (P  ,6  o 


can  be  obtained  by  maximizing  the  likelihood  function 


b 


g(El  -P1'xi,e)de 


::U- 


Y'X 


[/yx  9(E2-fi2’xfc)  dcj  , 


a 

obs. 

where  g(-  ,  •)  is  the  bivariate  normal  density  function. 

Before  going  on  to  the  second  model,  it  will  be  useful  to  look  ■  ■ 
closer  at  the  covariances,  and  a  .  More  specifically,  we  should 
asks  What  is  the  meaning  of  a  negative  (or  positive)  o  or  o^,  an<* 
what  is  a  meaningful  relationship  between  the  two  covariances? 


There  are  eight  possible  outcomes  for  a,  and  a  : 

le  2e 


(1)  oie  >  0, 


u_  >0  and  o,  >  o,  ;  (2)o  >0  and  o„  >0  and  o,  <o_;(3)o,  <  0, 

2c  lc  2c  lc  2c  le  2e  lc 

o_  <0  and  o  >  o_  ;  (4)  o  <  0,  o_  <0  and  o,  <  o_  ;  (5)  a,  <0 
2e  lc  2c'  lc  2c  le  2e  le 

and  h_  >  0>  (6)o,  >  0  and  o_  <  0;  (7)o,  >  0  ,  o_  >0  and  o,  =  o_  , 

.  2r  le  2e  lc  2e  le  2e 

and,  (8)  o,  <  0,  o_  <0  and  o,  =  o_  . 

le  /e  lc  2e 

Before  deciding  if  any  of  these  eight  outcomes  make  sense,  we  need 
to  understand  what  positive  and  negative  covariances  imply  in  terms  of 
equations  (1)  -  (3). 

Recall  that  the  choice  equation  was  written  with  a  minus  sign  in  front  of 
the  disturbance  term.  Because  of  this,  a  positive  sign  for  the  covariance  ale 
means  that  individuals  who  are  not  expected  to  choose  college  (i.e.,  Y’X  <  0) 

but  do  (because  e  <  0) ,  have  on  average  lpwer  than  expected  earnings  in 
the  college  wage  equation  (because  e^  will  tend  to  be  negative  for  this 
group).  In  terms  of  the  conditional  means,  a  positive  o^means: 


E(F,|  1=1)  <  B1’X1 


1 


K(l\  T=0)  B  ’X.. 


1  1 


A  positive  means  that  individuals  who  are  expected  to  choose 
college  (i.e.,  y’X  >  0)  but  do  not  (because  c  >  0),  have  on  average 
higher  than  expected  earnings  in  the  non-college  wage  equation  (because  c^ 


will  tend  to  be  positive  for  this  group).  In  terms  of  conditional  means, 
a  positive  means: 

E(E2|l=0)  >  B2 '  X2  '  3n<*  therefore 

E(E2jl=l)  <  B2’X2  . 

So  a  positive  o  means  that  if  both  the  non-college  and  college 
lc 

groups  go  to  college,  the  non-college  group  would  earn  more  than  the 
college  group.  Similarly,  a  positive  a  means  that  if  neither  group 
goes  to  college,  the  non-college  group  would  again  earn  more  than  the 
college  group.  So  in  the  the  case  of  positive  covariances,  the  non- 
colioge  group  dominates  the  college  groups  in  both  earning  equations. 
Before  we  decide  whether  or  not  these  relationships  make  sense,  consider 
the  following  interpretation  for  negative  covariances. 

A  negative  o  means  that  individuals  who  are  not  expected  to 
choose  college  but  do,  have  on  average  higher  than  expected  earnings  in 
the  college  wage  equation.  That  is,  a  negative  o  implies: 

l 

E(eJ  1=1)  >  B^Xj^ 

E(eJ  1=0)  <  B1'X1  . 

A  negative  a  means  that  individuals  who  are  expected  to  choose 
college  but  do  not,  have  on  average  lower  than  expected  earnings  in  the 
non-college  wage  equation.  That  is,  a  negative  o2£  implies: 

E(i:_,|  1=0)  <  B^*X2  ,  and  therefore 

e(e21 1=1)  >  b2'x2. 

So  in  the  case  of  negative  covariances,  the  college  group  dominates 
the  non-college  group  in  both  earnings  equations. 

At  first  glance,  it  might  appear  that  only  outcome  5  <  0  an<*  0 


makes  intuitive  sense.  That  is,  one  might  expect  the  college  group  to  have 
a  higher  college  wage  (relative  to  the  non-college  group's  college  wage); 
and  the  non-college  group  to  have  a  higher  non-college  wage  (relative  to 
the  college  group’s  non-college  wage).  I  suppose  another  reasonable 
guess  might  be  that  outcomes  3  and  4  -  where  both  o  and  °2c  are 
negative  -  make  intuitive  sense.  That  is,  the  college  groups  dominates 
the  non-college  group  -in  both  wage  equations.  Perhaps  the  least  appealing 
outcomes  that  initially  come  to  mind  are  outcomes  1,2  and  6.  Outcomes 
1  and  2  say  that  the  non-college  groups  dominates  the  college  group  in 
both-  wage  equations.  Outcome  6  says  that  the  non-college  groups  dominates 
the  college  wage  equation  and  the  college  group  dominates  the  non-college 
wage  equation. 

However,  initial  guesses  are  sometimes  wrong.  In  actuality,  outcomes 
2,  4  and  5  are  plausible  and  outcomes  1,  3  and  6  are  not.  To  see  this,  one 

only  needs  to  look  at  Ricardo's  theory  of  comparative  advantage. 

Consider  outcome  2.  In  this  case  the  college  group  earns  less  than  the 

non-college  group  in  both  wage  equations.  However,  since  o  <  a  ,  the  college 

If  y. 

group  lias  a  comparative  advantage  in  the  college  wage  equation.  That  is,  while 

the  college  group  has  lower  then  mean  earnings  (B|X  and  B^X^)  *n  both 

equations,  the  difference  between  mean  and  actual  earnings  is  smaller  in  the 

college  wage  equation  than  in  the  non-college  wage  equation.  So  if  individuals 

base  the  college  decision  on  potential  earnings  and  BjXj  =  B^X^*  then  on  average 

B '  X '  '  t ,  N  H ’ X.  +  i  ,  for  the  college  group.  This  is  becasue  o,  <  o_  implies 
111222  J  le2e 

‘  E(E, 1 1=1)  >  E(E,|l=l) , 

in  the  case  where  P^X^=  **2*2'  A  similar  argument  can  be  made  if  /  ®2X2‘ 

In  other  words,  a  model  that  says  individuals  choose  college  if 


implies  that  u  <  u  is  the  most  sen  <Me  outcome.  Similar  arguments  concerning 
It:  2\ 

tlie  logical  consistency  of  outcomes  4  eid  5,  and  the  inconsistency  of  outcomes 
1,  3  and  6  can  be  made.  So  long  as  the  only  criterion  for  choosing  college  is 
relative  earnings  in  the  two  wage  equations,  the  necessary  condition  for 

consistency  of  the  model  is  that  o,  <  a  _  .  Of  course,  if  the  individual  looks 

le  2e 

at  intangibles  other  than  earnings  when  deciding  or  whether  or  not  to  go  to 
college,  any  of  the  6  outcomes  mentioned  could  be  consistent  with  the  model. 

But  even  under  the  most  general  specification  of  the  choice  equation,  so  long 
as  relative  earnings  enters  into  the  choice  equation,  the  most  plausible  result 

is  that  o,  <  a„  .  Finally,  outcomes  7  and  8  neither  refute  nor  support  a  priori 

lr  2c 

expectations.  Since  they  are  in  essence  "neutral  outcomes,"  nothing  needs  to 

be  said  about  their  consistency  with  the  underlying  model. 

Model  2:  A  Selectivity  Model  of  Education  and  Earnings 

When  the  Amount  of  College  Enters  Simultaneously 
Into  the  Model 

In  this  model,  let  S*  denote  the  "desired"  years  of  college  education  and 
S  the  actual  years  of  college  education.  E^  and  E^  are  as  defined  in  model  1. 

The  model  now  becomes: 

S  =  S*  =  Y*’X  -  c  *  (1) 

F1  =  aS  +  B|X1  +  e  (2) 

E2  =  B2X2  +  V  (3) 


where  we  only  observe: 


S  =  S*  =  v* '  X  -  e  ,  | 

E1  =  us  +  n ■ x2  +  r 1  1 


iff  S*  >  0 


,and, 


S  =  0 


e2  ■  b;x2  *  v 


I  1 


iff  S*  <  0 


I  lie  assumptions  about  r  ,  c  and  are  the  same  as  in  model  1. 

Model  2  is  a  simultaneous  equations  model  with  selectivity.  Estimation 

of  model  2  is  discussed  in  detail  in  Kenny  et.al.  (1978),  and  Lee,  Maddala 

2 

and  Trost  (1977).  Briefly,  what  we  do  is  estimate  y*  and  o  (=var.  of  e*)  in 
equation  1  with  a  Tobit  program  and  get  S  =  y*'X.  We  then  estimate  the  following 
two  stage  regression  equations: 

V\ 

(2a) 


-  as  +  B1X1  -aK  ♦  n, 


E„  =  B’X„  +  ci 


f(  Y’X) 


‘2  ~2“2  '  2€  1-F (  Y'X)  +  "2' 


(3a) 


2e 


%f! 

where  y =  7 (1  and  and  have  zero  means.  The  covariance  terms  and  a 

have  the  same  interpretation  as  before. 

The  main  difference  between  models  1  and  2  is  that  the  choice  equation 

in  model  2  is  a  Tobit  type  equation  rather  than  a  probit  type  equation,  and 

years  of  college  enters  directly  in  model  a,'s  college  wage  equation.  Other 

than  these  two  differences,  the  estimation  and  interpretation  of  o  and  o 

le  2c 

is  the  same, in  model  2  as  in  model  1. 

2  2  2 

Maximum  likelihood  estimates  of  y,  B  ,  B  ,  a  ,0,0,0  and  o 

X  c  faf-  X  b 

in  model  2  can  be  obtained  by  maximizing  the  following  likelihood  function: 


‘■-A  Jo  f* 

'  -  fO 

where  g(  •  ,  •  )  is  the  bivariate  normal  density  function. 


g  (c  ,  e  ^ )  de  , 


Model  3:  A  Model  of  Labor  Force  Participation,  Wages  and  the 
Returns  to  College. 


The  purpose  of  models  1  and  2  was  to  measure  the  effect  of  a  college 
education  on  ehrninqs;  taking  into  account  possible  selectivity  bias  in  the 
observed  college  and  non-college  earnings  data.  Those  models  assumed,  however, 
that  everyone  in  the  sample  worked.  If  this  assumption  is  not  valid,  a 


10 


different  model  and  estimation  technique  is  necessary.  A  discussion  of  this 
alternative  model  follows. 

Model  3  extends  model  1  by  incorporating  the  decision  of  whether  or  not 
to  work  into  the  model.  By  specifying  the  earnings  model  in  this  fashion,  it 
becomes  apparent  that  observed  earnings  may  be  subject  to  two  types  of 
self-selectivity.  First,  the  college  versus  non-college  earnings  data  may  be 
subject  to  the  type  of  self-selectivity  bias  discussed  in  models  1  and  2.  Second, 
since  we  only  observe  the  wages  of  individuals  who  choose  to  work,  the  wage  data 
may  also  be  subject  to  the  type  of  self-selectivity  bias  discussed  in  Heckman 
(1974)  and  Nelson  (1977) .  Thus,  model  3  is: 

*  I*  =  B* ' X  -  e  *  (la) 

c  c  c  c 

I  *  =  B  *  ’ X  -  e  *  (lb) 

P  P  P  P 


where  f(e*,  r 
c 

only  observe: 


*) 

P 


is  distributed  bivariaf-e  normal  with  correlation  p  ,  and  we 

pc 


I  =  1 
c 

=  0 


iff  I  *  >  0  (Choose  college  in  year  t) 
c  — 

otherwise 

and 


1  =1  iff  I  *  >  0  (Choose  to  work  in  year  t  +  n) 

P  P  “ 

=  0  otherwise. 

Here  1*  is  an  unobservable  index  whereby  an  individual  decides  whether  or  not 
c 

to  go  to  college,  and  I*  is  an  unobservable  index  whereby  an  individual  decides 

P 

whether  or  not  to  work.  Model  3  also  specifies  two  earnings  equations: 


Rixi  f  ri 

(college) 

(2) 

B2X2  +  C2' 

(w/o  college) 

(3) 

where,  ,  E^,  ,  P  ^  are  previously  defined,  and  e ^  are  not  necessarily 

independent  of  r *  and  e  *  ,  and  we  only  observe: 
c  p 


iff 


1 


I  =1  and  I  = 
c  l> 


E. 


and 


iff  1=0  and  I  =  1 . 
c  P 


I  could  also  specify  two  shadow  wage  equations,  but  this  point  will  be  discussed 
elsewhere  in  a  paper  by  Fishe,  Trost  and  Lurie  (1979),  and  is  also  discussed 
in  Nelson  ( 1977) . 

If  e*  and  e*  are  independent,  B,  and  B  can  be  estimated  from: 
c  p  12 


f  ( B  '  X  ) 


f  ( B '  X  ) 
c  c 


E1  niXl  17 lc  F(b  '  X  )  '  °  If.  F(6crX  )  +  ni 

f>  p  p  c  c  c 


(2a) 


f  (B  '  X  ) 
c  c 


f  (B  *  X  ) 

E2  ~  B2X2~°2e  F7B^xS  +  °2cc  1-F(B'X  )  +  'V 

r»  r*  C  C 


p  p 


(3a) 


where  o, ,’s  are  covariances,  B  and  B  are  probit  estimates  of  (la)  and 
il  pc 

(lb),  respectively,  and  and  n 2  have  zero  means.  If  and  are  not 

independent  the  estimation  approach  is  similiar  ,  but  (2a)  and  (2b)  are  messier, 

and  B  and  B  have  to  be  estimated  with  a  bivariate  probit  model.  Details  will  be 
p  *  c 

given  in  Fishe,  Trost  and  Lurie  (1979). 


The  expected  relationship  between  o  and  o  has  already  been  discussed 

c  c 

in  models’ 1  and  2.  That  is,  the  model  only  makes  sense  if  o„  >  o.  The 

2e  le 
c  c 

expected  relationships  for  a,  and  can  only  be  discussed  in  the  context 

lc  2  e 

P  P 

of  shadow  wage  equations,  which  are  not  present  in  model  3. 


Model  4:  The  Effect  of  Military  Training  on  Civilian  Earnings 

The  pur|K\se  of  this  model  is  to  measure  the  returns  to  military  occupational 
training  in  electronics.  A  naive  procedure  is  to  estimate  a  dummy  variable 
regression  equation  where  civilian  earnings  are  regressed  on  several  explanatory 
variables  and  a  set  of  dummy  variables  that  depend  on  whether  or  not  a  veteran 


receives  military  training  in  electronics,  and/or,  a  veteran  takes  a  civilian 
job  in  electronics  after  leaving  the  service. 

There  are  two  selectivity  problems  that  flaw  the  naive  approach.  The 
first  selectivity  problem  is  that  some  veterans  choose  to  work  in  related  jobs 
while  others  do  not.  Although  the  usual  procedure  treats  occupational  choice 
as  exogenous,  it  is  really  endogenous.  The  observed  earnings  differences 
between  those  who  choose  related  jobs  and  those  who  choose  unrelated  jobs  will 
not,  in  general,  give  unbiased  estimates  of  the  earnings  effect  of  training. 

A  second  selectivity  problem  also  complicates  the  analysis  of  veterans' 

earnings.  This  is  the  occupational  assignment  process  that  occurs  upon 

entering  military  service.  New  entrants  are  assigned  to  various  military 

occupations  on  the  basis  of  observable  educational  background  and  upon  their 

preferences  and  other  unobservable  factors  such  as  occupations  available  when 

ttiey  entered  service.  This  occupational  selection  process  may  also  distort 

post-service  earnings  comparisons. 

< 

The  model  is: 


B*  *  X 

-  e* 

(la) 

m  m 

m 

B*  *  X 

+  ctYM  -  f* 

(lb) 

c  c 

c 

Mere  YC*  is  an  unobservable  index  whereby  a  veteran  chooses  whether  to  work  in 

electronics  after  leaving  the  military,  YM*  is  an  unobservable  index  whereby 

an  individual  is  chosen  for  military  electronics  training,  e*  and  ci  are 

m  w 

distributed  bivariate  normal,  and  we  only  observe: 

YM  =1  iff  YM*  >  0 

=  0  otherwise 

and 

YC  =  1  iff  YC*  >  0 


0 


otherwise  . 


I  also  specify  four  earnings  equations: 

(2) 

(3) 

(4) 

(5) 

where  e^  to  are  normally  distributed  and  are  not  necessarily  independent 

of  c*  and  e*;  and  we  only  observe 
m  c 

Y  iff  YM  =  1;  YC  =  1 

Y2  iff  YM  =  0;  YC  =  1 

Y3  iff  YM  =  1;  YC  =  0 

Y  iff  YM  =  0;  YC  =  0. 

4 


Y.  = 


*iXl  + 


Y„  = 


P2X2  +  C  2 


Y„  = 


^3  +  C3 


Y.  = 


BIX  +  Ca' 
4  4  4 


Here  Y^  is  civilian  earnings  if  the  veteran  receives  military  training  in 
electronics  (YM  =  1)  and  chooses  to  take  a  civilian  job  in  electronics  after 
leaving  the  service.  Y ^  to  Y^  are  civilian  earnings  defined  similarly.  A 
complete  explanation  of  the  model  is  given  in  Trost  and  Warner  (1978) . 

If  c*  and  e*  are  independent,  we  can  estimate  B,  to  B  and  the  covariances 
me  14 


a  .  .  from: 
1 7 


Y.  =  V. 


lr 


f(fVX  t-fxYM)  f(t*'X  ) 

_ j:  c _  M  _ m_m  + 

r(f'/x  +«ym)  X  f(B'x  j  ni 

c  c  mm  m 


(2a) 


f  (ft'X  tftYM) 
c  c 

v  =  n  •  y  .  m  - — - 

2  2  2  2 r  F(f*'X  +fi  YM) 

c  c  c 


f (B ' X  ) 

+  2f  I^f(P'xT  2 

m  **  m 


(3a) 


Y3  U3X3  +  "  3r  1-F  TS  ’  X  +nYM)  “  "  3r  F^E'X  )  +  n3 


f(f)'X  .H«YM) 

c  c 


f(6'X  ) 
m  m 


c  c 


m  m 


(4a) 


|1 


(5a) 


y  =  b;x  +  c 
4  4  4  4r 


f(P’X  +aVM  t 

c  c 


f (6’X  ) 
m  m 


l-l’ifFx  -iv-'M)  °  4r.  l-F(lFx  )  +  n4‘ 


m  m 


In  equations  2a  to  5a  ,  6  ,  a  and  t)  are  probit  estimates  of  the  two  choice 

cm 

equations,  the  o.^'s  are  covariances,  and  to  are  disturbances  with 
zero  means. 

If  veterans  look  at  relative  earnings  when  choosing  civilian  employment, 

then  we  should  expect  o  ,  >  o .  and  a  „  >  u_  .  Also,  if  the  military  only 

3c  lc  4c  -  2c 

c  c  c  c 

gives  electronics  training  to  men  who  have  "a  comparative  advantage  in  gaining 
v> 

from  the  training,  then  wo  would  expect  o_  >  o,  and  ci  >  <7,  .  We 

2c  lc  4c  3c 

mm  mm 

might  also  expect  n ,  to  o ,  to  be  negative. 

J  lc  4c 

m  m 

Finally,  if  c*  and  c*  arc  not  independent,  the  model  can  still  be  estimated 
m  c 

but  equations  2a  to  5a  are  more  complicated.  Details  will  be  given  in  Fishe, 
Trost  and  Lurie  (1079). 


EMPIRICAL  RESULTS 
« 

Estimates  of  models  1  to  4  are  given  in  Tables  la  to  4g,  and  Table  5 
defines  the  variables.  Tables  la  to  lc  contain  estimates  of  Model  1.  The 
estimates  are  based  on  the  Parnes  data  for  young  men.  Since  almost  all  the 
men  in  the  sample  have  jobs,  model  1  is  the  appropriate  model.  Details  of 
the  estimates  are  given  in  a  paper  by  Fishe,  Trost  and  Lurie  (1979). 

Tables  2a  to  2d  contain  estimates  of  Model  2.  Tire  estimates  are  based 
on  the  project  talent  data  for  young  men.  Again,  almost  all  the  men  in  the 
saTnple  had  jobs.  Details  of  the  estimates  are  given  in  Kenny,  et.al.  (1979) 
and  Lee,  Maddala  and  Trost  (1977). 

Tables  3a  to  3d  contain  estimates  of  Model  3.  These  estimates  are  based 
on  the  Parnes  data  for  young  women.  Since  only  about  one-half  of  the  women 


in  the  sample  had  jobs  in  the  year  we  used  to  estimate  the  model.  Model  3  is 
the  appropriate  model.  Details  of  these  estimates  will  appear  in  Fishe,  Trost 
and  Lurie  (1979). 

Tables  4a  to  4g  contain  estimates  of  Model  4.  These  estimates  are  based 
on  the  earnings  records  of  11,941  veterans.  Details  of  these  estimates  are 
given  in  Trost  and  Warner  (1978) . 

While  a  complete  discussion  of  these  tables  would  be  too  lengthy  to  give 
here,  it  may  be  useful  to  see  if  the  estimated  covariances  have  the  expected 
relationships  discussed  in  section  2.  Table  6  gives  the  estimated  covariances 
for-  models  1  to  4.  The  expected  relationships  were:  the  covariances  in 
coL  (2)  should  be  greater  than  the  covariances  in  col.  (1).  In  general,  this 
expected  relationship  holds.  So  the  estimates  of  Models  1  to  4  are  consistent 
with  the  proposed  hypotheses  presented  in  section  2. 

CONCLUSION 

In  thi5  paper  I  review  four  models  of  selectivity  and  give  an  interpretation 
of  the  covariance  terms  that  are  particular  to  selectivity  models.  While  some 
of  the  models  I  examine  in  this  paper  have  been  discussed  elsewhere,  the  hypo¬ 
thesis  I  present  about  the  expected  relationship  between  covariance  is  new. 

In  general,  I  found  that  the  estimates  of  the  four  models  support  my  hypothesis. 


TABLE  1A 


trobit  Estimates  of  College  Decision* 
for  Young  Men  (Parnes  Data) 


Variable  Estimates 


Constant  -6.4881 

(20.52) 


DBLACK 

I Q 

DSOUTH 


EDEEM 


Number  of 
Observations 

Number  of  l's 
(college) 

•Number  of  0's 
(w/o  college) 

-2»Log 

(likelihood  ratio) 

PF  (Degrees  of 
Freedom) 


.3768 

(3.42) 

.0478 

(17.38) 

.16 

(2.22) 

.136 

(10.41) 


1863 


906 


957 


605.65 


4 


♦Dependent  Variable  =  1  if  individual  goes  to  college 

=  0  if  otherwise 

**t  -  values  are  in  parentheses 


TABLE  IB 

Two  Stage  and  OI.S  Results*  :  Young  Men  (Parnes  Data) 


Variables 


College** 
Two  Stage 


Estimates 

OLS 


w/o  College** 
Two  Stage 


Estimates 

OLS 


DSOUTH 

-.0028 

(.09) 

-.0005 

(.01) 

-.1923 

(7.05) 

-.1885 

(6.B9) 

DBLACK 

-.0854 

(1.56) 

-.0842 

(1.54) 

-.1098 

(2.89) 

-.0826 

(2.24) 

ACE 

.0619 

(11.98) 

.062 

(12.00) 

.0286 

(6.82) 

.027  3 
(6.53) 

IQ 

.002 

(.89) 

.0028 

(2.40) 

-.0002 
'  (.01) 

.0044 

(4.33) 

DMA RR I ED 

.1946 

(6.08) 

.1941 

.  1379 
(4.73) 

.1387 

(4.74) 

Constant 

-.3267 

-.43495 

(6.07) 

.6014 

.3056 

Truncation 

2f_ 

F 

Variable 

.0297 

(.42) 

Tr  uncation 
f 

1-F 

Variable 

.  1868 
(2.83) 

#  of  Obs. 

904 

904 

957 

>  957 

-2 

R 

( 

.2229 

.  2236 

.1793 

.1732 

Std  Error 

.43174 

.43154 

.38252 

.38393 

*  Dependent  Variable=  Ln  (wage) 


**  t-Values  are  in  parentheses, 
the  t-values  overstate  the 


For  the  two  stage  estimates 
true  asymptotic  t's. 


Table  1C* 


Mean  Wages  for  Young  Men  (Parnes  Data) 


Raw  Data  Predicted* from  OLS  Estimates  Predicted*  from  two  Stage  Estimates 


■allege  w/o  college 

college 

w/o  college 

college 

w/o  college 

$5.55  $4.58 

A  =  $.97 

$4.29 

$4.29 

A  =  $0.0 

$4.40 

$3.76 

A  =  $.64 

♦predictions  are  based  on  Age  =26, 

IQ  =  100, 

North,  white  and  single. 


Tabje  ' 

I1  KOI’. I  I  i;()UAIION  TO  KKP1  AIN  1111.  DKCISION  OF 

win: hii:k  ok  not  10  <;o  to  coi.i.kck 


Variable' 

Cot'ff.  list. 

S.K. 

Const . 

-5. 259 

2.108 

Rural  (Dummy) 

-  .  lb  A 

.093 

S|>  1  i  1 

-  .  078 

.  1 A  5 

mmii 

.015 

.001 

MOCWAC 

.922 

.589 

l-.DMAI. 

.  OAA 

.017 

i:dt  km 

.  02*» 

.019 

-cm  i. 

-  .05  7 

.021 

r  — 


Table  ?r 

OI.S  ESTIMATES  OF  KAON  I  NOS  EQUATIONS 


( : i >of  f  ic  to i it s  and  (In  parentliese s)  standard  errors 


Vi r table 

15*1 .  3.1 

Eq.  3.2 

Eq.  3.3 

Eq.  3.4 

Eq.  3.5 

Murr i ed 

.13  34 

.1537 

.  1  553 

.2671 

.0991 

(Dummy) 

(  .  02  (>4 ) 

(.0260) 

(.0261 

(.0481) 

(.0306) 

Itiira  1 

-  .0752 

-  .0748 

-  .0717 

-  .1457 

-  .0260 

( Dummy) 

( .0260) 

(.0257) 

(.0257) 

(.0401) 

(.0333) 

maim 

.00.19 

.0013 

-  .0012 

.0010 

.0014 

( .0003) 

(.0003) 

(.0003) 

(.0005) 

(.0004) 

no 

-  .0115 

-  .0084 

-  . 0090 

.0089 

-  .0228 

( .0052) 

(.0052) 

(.0052) 

(.0079) 

(.0069) 

s  * 

.0437 

.0321 

.0406 

* 

(.0072) 

(.0051) 

(.0074) 

Col  1  ope  Ed . 

.0749 

-  .0812 

( Dummy) 

( .0256) 

(.0361) 

Cons t  ant 

5.803 

5.851 

5.833 

5.762 

5.825 

It 

. 09466 

.11769 

. 1 1 507 

.09272 

. 10411 

S  .  1  . 

.  192  5 

.  3875 

. 38804 

.  392  7 

.  3804 

•'1  of  obs. 

1  373 

1373 

1  37  l 

464 

909 

In  Eq.  3.1  years  of  college  is  omitted.  Only  college  odurat  ion  dummy  is  used. 
In  r.-| .  1.2  both  years  of  college  and  college  education  dummy  are  used. 

In  Eq.  3.3  only  years  of  college  is  used.  Tlio.  college  dummy  is  dropped. 

1  c| .  3.4  is  a  separate  earnings  equation  for  those  with  no  college  education. 

Eq .  3.5  is  a  separate  earnings  equation  for  those  with  college  education. 


>>- 

Tablo  2d 

ML  E: .timatcs  of  the  Com pi  etc  Model 


I.  Years  o£  College  Education  S 


VARIABLES 

Tobit  ML 

ML  Estimation 

Rural 

-0.2731 
(0. 1069) 

Spl  i  t 

same  as  estimates 
in  table  2b1 

-0.4095 

(0.3038) 

MATH 

0.0421 

(0.0023) 

N(  >cwAc;n 

2. 3930 
(0.0572) 

EDMAL 

0.0914 

(0.0305) 

nni'i'.M 

0. 1140 
(0.0309) 

cun. 

-0.1912 

(0.0449) 

C(  .nstant 

-13.1150 
( 3. 0066) 

< 

[T.  Euinings  of  Individuals  with  College  Education 


variable 

Two  Stage  Consist 

Marital  Status 

Estimation 

0 . 1  0*29 

(0.0314) 

Kura  1 

-0.0272 

(0.0349) 

MATH 

0 . 00  1 .3 
(0.0009) 

HC 

-9 . 0.300 

(0.0(170) 

s 

0.0466 

(0.0333) 

r H r.stima  t  ion 

0.1013 

(0.0324) 

-0.0174 

(0.0405) 

o.(ini  2 
(0.0000) 

-0.0.3  ;n 
(0.0066) 

0.037(3 

(0.0177) 


Constant 


5.(121  1 
(0.1710) 


5.0051 

(0.0049) 


III. 


IV. 


Table  2d  (Continued) 


Earnings  of  Individuals  with  NO  College  Education 


VARIABLES 

TWO  STAGE  CONSIST 

ML  ESTIMATION 

Marital  Status 

ESTIMATION 

.2719 

0.2696 

(.0475) 

(0.0454) 

Rural 

-.1088  . 

-0.1096 

(.0472) 

(0.0454) 

MATH 

-.0022 

-0.0022 

( .0014) 

(0.0006) 

HC 

.0097 

0.0101 

(.0078) 

(0.0082) 

Constant 

5.7483 

5.7384 

(.0719) 

(0.0741) 

Error  Variances 

VARIABLES 

CONSISTENT 

ML  ESTIMATION 

« 

2 

o 

ESTIMATION 

7.6425 

7.3790 

1 

(0.4974) 

2 

V  _ 

0.1495 

0.1509 

2 

(0.00008) 

2' 
n  , 

0.1519 

0.2058 

3 

(0.0021) 

o , 

-0.00259 

0.0022 

le 

(0.1375) 

(L 

0.7979 

0.7962 

7r 

(0.0459) 

Likelihood  value 


3198.72 


Tab].-  3a 


Number  of  Observations  for  Young  Women 
(Parn<’S  Data) 

IC 

1  0 

1  550  620  £  =  1170 

0  270  759  £  =  1029 


820 


1379 


2199 


T  *■ 1 1  <'  3H :  Trobit  Estimates  of  College  and  Work 
Decision  Functions  for  Young  Women 
(Parnes  Data) 


Variables 


College  Decision  Estimates* 
Dep.Var.  =  l  if  college 
=  0  w/o  college 


Labor  Force  Participation 
Estimates* 

Dep.  Var.  =  1  if  work 
_  =  0  Not  Work 


DR LACK 


.6243 

(6.995) 


.  2268 
(2.70) 


Age 

-.0322 

(3.00) 

IQ 

.0348 

.00989 

(13.92) 

(4.61) 

DMa tried 

-.4268 

* 

(5.93) 

DNF. 

-.2121 

-.1748 

(2.17) 

(1.83! 

DNC 

-.2427 

-.0789 

(2.65) 

(.89) 

PSOMTH 

.  0066 

.099 

c 

(.07) 

(1.13] 

EDFF.M 

.1848 

(14.44) 

DCI1T1.  -.9883 

(16.28) 

Constant  -5.9475  .7723 

(19.93)  (2.07) 

»  of  cbs.  2199  2199 

»  of  l*s‘  820  1170 

It  of  O’s  1379  1029 


-2xlog  (Likelihood  Ratio)  605.44 


483.63 


DF(Deqrees  of  Freedom 
*  t-values  are  in  parentheses 


6 


8 


Table  3C:  OLS  and  Two  Stage  Regression  Results* 
for  Young  Women  (Parnes  Data) 


College  Estimates* 

w/o  College 

Estimates* 

VARIABLES 

TWO  STAGE 

OLS 

TWO  STAGE 

OLS 

DDLACK 

.  1536 

.1587 

-.0182 

.0170 

(3.29) 

(3.43) 

(.43) 

(.43) 

AGE 

.04  37 

.0355 

.0139 

.0116 

(6.73) 

(5.85) 

(2.56) 

(2.17) 

3P 

.004  3 

.00387 

.00158 

.0049 

(2.26) 

(2.97) 

(.88) 

(4.10) 

DMARRIED 

.0339 

-.0414 

.0142 

-.0153 

* 

(.87) 

(1.29) 

(-38) 

(.48) 

ONE 

.0614 

.0559 

.0223 

.00689 

(1.21) 

(1.11) 

(.43) 

(.13) 

PNC 

-.00996 

-.0022 

-.0476 

-.0705 

(.21) 

(.04) 

(.99) 

(1.50) 

DSOUTII 

-.0641 

-.0478 

-.1529 

-.1561 

(1.41) 

(1.04) 

(3.28) 

(3.341) 

-f(B',X  ) 

c  c 

- -7^-— - - 

-.0730 

F  (B  '  X  ) 

c  C 

(1.33) 

f  (B  '  X  ) 

c  c 

I  _  C’  /  V  ) 

.1406 

l  i  V  «»  / 

C  <; 

(2.10) 

-  f (B ' X  ) 

w  w 

r(Brx"P 

.2143 

.0831 

w  w 

(3.31) 

(1.58) 

’IJSTANT 

4.3449 

4.581 

5.2488 

5.0127 

H  of  Obs. 

550 

550 

620 

620 

Std.  Error 

.3592 

.36285 

.3595 

.36102 

-  2 

R 

.  10345 

.08514 

.07324 

.0655 

♦Dependent  Variable 

=  l.n  (wage  x  100) 

1  i 

**  t-values  are  in 

parentheses.  For  the 

two  stage 

estimates  the  t-values 

overstate  their 

true  asympototic  t's. 
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Table  3D:  Mean  Wag  s  for  Young  Women  (Parnes  Data) 

Raw  Data  Predicted  from  OLS  Estimates*  Predicted  from  two  Stage  Estimates* 

College  w/o  College  College  w/o  College  College  w/o  College 

$3.97  S3. 24  $3.62  §3.32  $3.69  $3.20 

A  =  $.73  A  =  $.30  A  =  $.49 

*  Predictions  are  based  on  Age  =  26,  I£*  =  100,  Single, 

West  and  White. 


TABLE  4a 


NUMBER  OF  OBSERVATIONS,  BY 
TYPE  OF  MILITARY  TRAINING  AND  CIVILIAN 
OCCUPATION 

Mil  itary _ Training 


YM  =1  YM  =  0  Total 


Civilian 

Occupation 

- 

YC  =  1 

806 

867 

1673 

YC  ~  0 

4124 

63  44 

10,268 

Total 

4930 

7011 

11,941 

TABLE  4b 


AVERAGE  YEARLY  EARNINGS  AND  STANDARD  DEVIATION  OF  EARNINGS  1970-74 
BY  TYPE  OP  MILITARY  TRAINING  AND  CIVILIAN  OCCUPATION*  ' 


Military 

Traininq 

YM  =  1 

YM  =  0 

Marg inal 

Civil ian 

Occupation 

YC  «  1 

9400 

(2919) 

8958 

(2925) 

9171 

(2930) 

YC  .=  0 

0352 

(2933) 

8775 

(3411) 

8605 
(3234  ) 

Marginal 

0524 

(295G) 

8798 

(3355) 

*  Standard  deviations  in  parentheses. 


TABLE  4c 


OLS  EARNINGS  REGRESSION  WITH  DUMMY  VARIABLES* 


Variable 

Intercept 

AFQT 

ED 

Wh  i  to 

Dl ( YM=1 ; YC~ J ) 
D2 ( YM=0 ; YC- 1 ) 
D3(YM=1 ;  YC-O) 


Coefficients 

1106.64 

13.62 

(10.57) 

522.11 

(25.12) 

586.43 
(  4.72) 

695.19 
(  6.06) 

368.47 
(  3.34) 

-109.25 
(  1.76) 


KSQ  .10512 

Std.  Error  3026.90 

No.  of  Observations  11941 

Mean  of  Dependent  Variable  8684.77 


* T-- values  in  parentheses 


3] 


Variable 

Intercept 


AI'OT 


Wlii  te 


i 


TABLE  Ad 

OLS  EARNINGS  EQUATIONS* 

YM=1 ;  YC=1  YM=0 ;  YC=1  YM=1;YC=0 

2386.76  3404.63  2777.55 

21.19  23.17  10.91 

(  4.51)  (  5.34)  (  5.28) 

427.29  297.71  404.73 

(  3.93)  (  3.40)  (  10.63) 

650.93  793.74  301.37 

(  1.01)  (  1.62)  (  1.45) 


YM=0 ; YC=0 

184.16 

13.80 
(  7.28) 

587.21 

(21.56) 

709.89 
(  4.17) 
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TABLE  4e 

PROBIT  ANALYSIS  ON  THE  DEPENDENT  VARIABLE  YM 


Variable 

Coefficient 

t-val ue 

Intercept 

-.4617 

6.38 

Army 

-.1343 

2.80 

Navy 

.0852 

1.65 

Marine  Corps 

-.6753 

7.78 

Enlistee 

.4306 

14.20 

Race 

.1118 

2.09 

Ed  <  11 

.2193 

5.50 

Ed  n 

-.0269 

.45 

Ed  >  12 

-.7323 

18.42 

A  POT  , 

.0020 

3.65 

Number  of  Observations  =  11941 

Number  trained  in  electronics  =  4903 

Number  net  trained  in  electronics?  =  7011 

-2 x  Log  of  Likelihood  Ratio  (d£  =  9)  =  1034.132 

Log  of  Likelihood  function  =  -7577.5436 
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TABLE  4  f 

PROBIT  ANALYSIS  OF  THE  DEPENDENT  VARIABLE  YC 


Variable 

Coefficient 

t-Val ue 

intercept 

-1.5487 

14.59 

E4 

.0632 

1.14 

E5 

.1494 

2.59 

E6 

.0544 

.37 

Army 

-  .1262 

2.12 

Navy 

-  .0259 

.43 

Marine  Corps 

-  .2273 

2.08 

Enl.  ist.ee 

-  .0432 

1.12 

Race 

.1623 

2.20 

Ed  <  11 

.0201 

.39 

Ed  «=  ‘ll 

-  .0063 

.08 

Ed  >12 

-  .5126 

10.18 

A  EOT 

.0065 

9.50 

YH 

.1038 

3.43 

Number  of  Observations 

Number  takinq  Civilian  jobs  in  electronics 
Number  taking  non-eloclronics  civilian  jobs 
-2x  Log  of  likelihood  ratio  (df=13) 

Log  of  likelihood  function 


=  11941 

■  1673 

-  10268 
=  248.656 

=  -4713.6279 
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TABLE  4g 

•  TWO  STAGE  ESTIMATES  OF  EARNINGS  EQUATIONS* 


Vn  r  i  able 

YM=1 ; YC=1 

YM=0 ; YC=1 

YM=1;  YC---0 

YM=0 ; YC=0 

j  nterccpt 

6071.97 

7055.36 

1109.68 

-  248.14 

AFQT 

4.79 
(  .64) 

11.04 
(  1.74) 

5.05 
(  1.50) 

5.59 

(  2.04) 

Ed 

422.30 
(  3.63) 

377.10 
(  3.72 

367.03 
(  8.90) 

613.00 
(  16.53) 

Win  to 

116.1.39 
(  1.75) 

1065.05 
(  2.13) 

444.95 
(  2.05) 

889.79 
(  5.02) 

-2020.69 

-1243.01 

(  3.53) 

(  4.83) 

JE 

X 

<  d 

— 

961.35 

-1390.30 

(  1.54) 

(  4.96) 

3037.35 

2609.92 

'  A  >  • 

(  3.05) 

(  2.70) 

3754.57 

5251.47 

.i.-)(rcv 

(  3.04) 

(  4.92) 

nso 

.07997 

.00294 

.06099 

.13254 

Std  Error 

2001.92 

2003.20 

2043.26 

3177.61 

No.  of 

o!-*r  i'i  vntions 

006 

067 

4124 

6144 

*Tlio  t-vnluo 
Trorjt  (1977) 

r>  in  pnrenthor.or.  are  si  iglitly 
[nr  n  tl  i r;ou:;r.  ion . 

biar.eil.  Soe 

Loo,  Maddnln  < 

Also,  B  contains  B^,  and  n;  and 

contains  X c 

and  YM. 

List  of  Variables 


E  =  Natural  logarithm  of  hourly  wage 

Y  =  Annual  Darnings 

S  =  Years  of  college  education  in  1971 

MATH  -  Score  on  a  composite  of  mathematics  achievement  tests  in  1960 

SPLIT  =  0  if  children  living  with  both  mother  and  father  in  1960 

=  1  otherwise 

DDMAL  -  Years  of  father's  education 

EDKKM  -  Years  of  mother's  education 

RURAL  -  1  if  pupils  in  grades  9-12  came  from  an  area  primarily  small  town 
(under  5,000  people)  or  rural  farm 
-0  otherwise 

MUCWAC  =  (mean  occupational  wage)  log  of  mean  earnings  (as  of  1960)  of 
full  time  workers  in  father's  occupation 

lit’  =  Number  of  jobs  held  between  1965  and  1970 

DBLACK  =1  if  non-white 

=0  otherwise 

DSOUTH  =1  if  live  in  South 
~0  otherwise 

DNE  ‘  ~1  if  live  in  Northeast 

-0  otherwise 

DNC  -l  if  live  in  North  Central 

=0  otherwise 

DtlAKRIEO  -1  if  married 
=0  otherwise 

DCHIL  -1  if  have  children  -  5  years  of  age 

=0  otherwise 

IQ  =  IQ  Score 

AGE  =  Age  in  years 

Al'QT  Armed  forces  Qualifying  Test  van  IQ  typ>e  test) 

White  =1  if  white 

_  =0  if  otherwise 

YM  --I.  if  individual  receives  military  training  in  electronics 

-0  otherwise 

YC  -1  if  individual  takes  a  civilian  job  in  electronics  after  the  military 

-0  otherwise 

1C  ~1  if  individual  goes  to  college 

=0  otherwise 

IW  =1  if  individual  works 

=0  otherwise 


Table  6:  A  Compari  .on  of  Estimated  Covariances 
for  Models  1  to  4 


Model 


Col.  1*  Col.  2** 

Estimated  covariance  Estimated  covariance 

between  decision  equation  between  decision  equation 
and  earninqs  equation  and  earnings  equation 

in  regime  where  1=1  in  regime  where  1=0 


1 .  Model  1 : 

compare  to 


.0297 


Model  2: 

compares  to  o 


3.  Model  3: 

compares  o,  to  o 

-  It  2e 


Mode 1  3 : 

compares  o  to  o 

Itc  3cc 

compares  a  to 

2ec  o„ 

4tc 

compares  c  to  a 

1cm  2t m 


3837.35 


2089 .92 


-2028.09 


3754.57 


5251.47 


-961.35 


compares  a  to 

,  3tm  4um 


-1243.01 


-1390.30 


ltc'  ( 2ec'  ltm'  ^3£m 


u „  ,  o  „  ,  (J  ,  o. 

3f c  4tc  2em  4tm. 
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